Background

This report works through a method to identify which stations do not have valid data inside the 2020 IR window. Stations listed as impaired are deemed valid even if they do not have data in a given window (carryover stations). Citizen Monitoring/Non agency stations will be flagged but not dropped from this analysis. Not all citmon/non agency data resides in ‘Conventionals’ (yet), so we cannot be sure what stations definitely need to be dropped.

Data Input

We will start with Cleo’s final stations file for 2020IR.

## Reading layer `2020_wqms' from data source `C:\HardDriveBackup\R\GitHub\IR2022\1.preprocessData\GIS\2020_wqms.shp' using driver `ESRI Shapefile'
## Simple feature collection with 7278 features and 53 fields
## geometry type:  POINT
## dimension:      XY
## bbox:           xmin: -372869.6 ymin: 60001.45 xmax: 368073.1 ymax: 381932.3
## epsg (SRID):    NA
## proj4string:    +proj=lcc +lat_1=37 +lat_2=39.5 +lat_0=36 +lon_0=-79.5 +x_0=0 +y_0=0 +datum=NAD83 +units=m +no_defs

And Roger’s ‘Conventionals’ data pull for the 2020 IR window (2013-2018). We will change the date/time field to an appropriate format for filtering and remove sites that are missing either a latitude or longitude value. Those sites are summarized below.

Now let’s make a dataset of distinct sites sampled in the window by dropping all data in window.

Analysis

The first step is to join on Station ID, but first we need to make a special joining field since some StationID’s use lower case letters.

Now remove the stations that joined with conventionals from our “to do” list.

Monitored vs Assessed

For the stations that joined (i.e. were sampled 2013-2108), we need to add some field to help us out. First we will automatically give these stations a TRUE for the new field MONITOR_STATUS because they were monitored in the 2020 assessment cycle. Next we will assign a field called ASSESSMENT_STATUS to denote whether something is assessed or not. A station is assessed if any of the _STAT fields contain a ‘S’ (or ‘FS’) or ‘IM’ designation.

Here are all the sites that were sampled in 2020 IR window but lacked a S, FS, or IM designations for any parameter. These are generated by filtering table above with is.na(ASSESSMENT_STATUS). Might want to dig into these.

Juicy Stuff

Now to figuring out what is going on with the final2020toDo (stations that didn’t join to data from 2013-2018).

First let’s drop anything not sampled by DEQ bc we just don’t have good ways of organizing that data for now. Those sites are displayed below.

Now let’s run that same ‘ASSESSMENT_STATUS’ analysis on these remaining 1126 sites to see where we may have some lingering problems.

Here are all the sites that were not sampled in 2020 IR window and lacked a S, FS, or IM designations for any parameter. These are generated by filtering table above with is.na(ASSESSMENT_STATUS). Definitely want to dig into these.